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Dynamical mean-field theory and its cluster extensions provide a very useful approach for ex- 
amining phase transitions in model Hamiltonians, and, in combination with electronic structure 
theory, constitute powerful methods to treat strongly correlated materials. The key advantage to 
the technique is that, unlike competing real-space methods, the sign problem is well controlled in 
the Hirsch-Fye (HF) quantum Monte Carlo used as an exact cluster solver. However, an important 
computational bottleneck remains; the HF method scales as the cube of the inverse temperature, 
13. This often makes simulations at low temperatures extremely challenging. We present here a new 
method based on determinant quantum Monte Carlo which scales linearly in j3, with a quadratic 
term that comes in to play for the number of time slices larger than hundred, and demonstrate that 
the sign problem is identical to HF. 
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INTRODUCTION 



Quantum Monte Carlo (QMC) methods provide an im- 
portant methodology for solving for the properties of in- 
teracting Fermi systems. In auxiliary field techniques 
0], the partition function, Z = Ti exp[—f3H] is expressed 
as a path integral, for example, by discretizing the imagi- 
nary time /? into L intervals of length At, and separating 
the one body (kinetic) and two-body (interaction) terms. 
The latter are then decoupled through the introduction 
of a Hirsch-Hubbard-Stratonovich (HHS) field which 
reduces the problem to a quadratic form. The fermion 
degrees of freedom can be integrated out analytically, 
leaving an expression for the partition function which 
is a sum over the possible configurations of the auxil- 
iary field. For interacting lattice Hamiltonians, such as 
the Hubbard model, this field depends both upon the 
spatial site and on the imaginary time coordinate. The 
sum over configurations is performed stochastically, for 
example, by suggesting local changes and accepting or 
rejecting with the Metropolis algorithm. The problem is 
challenging numerically because the summand is the de- 
terminant of a product of matrices, one for each fermion 
species. The determinant is costly to evaluate, and can 
also become negative at low temperatures, which consti- 
tutes the fermion sign problem 8]. 

There are different ways to represent the matrices. 
In the determinant quantum Monte Carlo (DQMC) ap- 
proach |l| , the matrices have dimension equal to the num- 
ber of spatial lattice sites Nc. The matrices are dense, 
and involve the product of L sparse matrices. The al- 
gorithm scaling, N^L, arises from the need to update 
NcL field variables at a cost of per update, where 
advantage is taken of an identity for the inverse and de- 
terminants of A^c-dimensional matrices which differ only 
by a rank-one change. Simulations with this method 
can now be done on many hundreds of spatial sites. In 



situations where particle-hole symmetry prevents a sign 
problem, for example, the half-filled Hubbard Hamilto- 
nian, one can reach arbitrarily low temperatures. DQMC 
simulations have proven the existence of long-range an- 
tiferromagnetic order in the two-dimensional half-filled 
Hubbard model 0, as well as accurately determined 
the nature of the spectral function and thermodynamic 
properties at this density [l^, [lH ■ 

Alternatively, in the algorithm developed by Hirsch and 
Fye (HF) [l2l| for embedded-cluster problems, a larger, 
sparse matrix of dimension NcL is considered. The ad- 
vantage of the HF-QMC approach is that the matrices 
are better conditioned (no product of L matrices is in- 
volved) and also they remain positive to much lower tem- 
peratures; the sign problem is far less severe in the HF- 
QMC method. However, because determinants of larger 
matrices are involved, the HF-QMC algorithm scales as 
N^L^. For this reason, HF-QMC has seen its most 
powerful applications within dynamical mean-field the- 
ory (DMFT) [H, [ill and its cluster extensions, the dy- 
namical cluster approximation (DC A) [l^ , and the cellu- 
lar dynamical mean-field theory (CDMFT) [l^ for which 
Nc is typically small. In effect, DMFT trades the large 
lattice sizes Nc, and Nc scaling of DQMC where spatial 
correlations can be explored, for the ability to reach much 
lower temperatures at general fillings at the cost of less 
real-space information, apart from that obtained from 
the mean field. DMFT also can directly access phase 
transitions which can only be inferred from finite-size 
scaling in DQMC. 

In this paper, we describe a hybrid approach which 
combines some of the virtues of both DQMC and HF- 
QMC. The key algorithmic improvement is a reduction in 
the HF-QMC scaling to linear in L. The importance 
is that this allows much larger Nc to be considered. At 
the same time, we demonstrate analytically (and confirm 
numerically) that the fermion sign problem in our hybrid 
algorithm is precisely the same as in HF-QMC, provided 
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that the coupHng to the host is fully taken into account. 
Thus, as in HF-QMC, we can reach low temperatures 
at quite general fillings. Our paper is organized as fol- 
lows. We first introduce the basic formalism, including a 
proof that the sign problem is unchanged from HF-QMC. 
We then show results for various physical observables in- 
cluding the quasi-particle weight, local moment, and the 
Green's function. We demonstrate that the results of 
our algorithm converge to the same values as that of a 
well-developed and tested HF-QMC code. We conclude 
with a comparison of the scaling properties of our new 
approach. 



FORMALISM 

DMFT, DC A, and other cluster extensions such as the 
CDMFT all map the lattice problem onto an effective 
cluster embedded in a self-consistently determined effec- 
tive medium. Here, we will add additional sites to the 
cluster to emulate the effective medium [13, [3 • The as- 
sociated formalism will be sketched for the DMFT and 
DCA, but it is easily extendable to include CDMFT. 

The DCA is a cluster mean-field theory which maps 
the original Z?— dimensional lattice model onto a periodic 
cluster of size Nc = embedded in a self-consistent 
host. This mapping is accomplished by replacing the 
Green's function and interaction used to calculate irre- 
ducible quantities such as the self-energy (S) by their 
coarse-grained analogs. Spatial correlations up to a range 
Lc are treated explicitly, while those at longer length 
scales are described at the mean-field level. For details 
of the DCA formalism and algorithm, please see Ref. [l^ . 

The DCA loop converges when the cluster Green's 
function equals the coarse-grained Green's function, 
Gc = G, 



-I- 



G(K,iw„) 



Nt ^ iuJn - Ek+K - iuJn) 



(1) 



iujn - ck - S(K, iujn) - r(K, iujn) ' 

where K labels a cluster wave number, aj„ is the Mat- 
subara frequency, k labels the lattice wave numbers in 
the Wigner-Seitz cell surrounding K, and Nt is the to- 
tal number of lattice sites. eK — Nc/Nt J^w ^ic+K 
coarse-grained dispersion and F is the single-particle hy- 
bridization between the DCA cluster and its effective 
medium. 

Here, we consider the two-dimensional (2D) single- 
band Hubbard model [s^. In order to employ DQMC 
as a cluster solver, we define an effective cluster Hamil- 
tonian to preserve the coarse-grained Green's function 
through the addition of host band degrees of freedom, 
which we label with d". 



(2) 



K.cj.a 



''kCk,o-"K,, 



H.c. 



The host band label, a, runs from 1 to N^- e"(K) is the 
dispersion for the band, is the coupling of the 
band to the c band, U is the strength of the interaction 
and Tiicr = cl^Cia is the number of spin-tr electrons on site 
i. Upon integration of the d— band degrees of freedom, 
the correlated band Green's function becomes 



Geff(K, iiVn) 



1 



iuJn - e(K) - i;(K, iw„) - r'(K, ia;„) ' 



where 



r'(K,zc^„) = 



No, 
a=l 



Kl 



6"(K) 



(3) 



(4) 



The parameters V-^ and e"(K) are adjusted to fit the 
DCA or DMFT hybridization function r'(K,iwJ « 
r(K,iw„). For this, we use Marquardt's method [20| to 
minimize the following merit function at each momentum 
point: 



X'{K) 



E 



r(K,iw„)-r'(K,zw„)r 



We define the scaled deviation as 



77(K) 



X(K) 

e(K) 



(5) 



(6) 



K,(T 



where ^ is the standard deviation of data. 

The discretization of the bath degrees of freedom 
has been considered in DMFT where exact diagonal- 
ization (ED) is used as the Hamiltonian-based impu- 
rity solver [17|] . Extensions of this method to dynamical 
cluster mean-field theories have also been largely imple- 
mented to study variety of models such as the extended 
Hubbard or multiband models 21-23]. The advantage of 
this method is that since ED is essentially exact, there 
is no systematic error beyond the discretization of the 
bath. Moreover, more complicated interactions than just 
the onsite Coulomb can be easily included in the Hamil- 
tonian. However, the disadvantage of ED is that the 
Hilbert space grows exponentially with the total size of 
the system, Nc{l-\-Na)- This greatly limits the size of the 
clusters that can be studied. This is specially true since 
(as we discuss below) smaller clusters generally require a 
higher number of non-interacting bands to fully account 
for the coupling to the bath, and for larger clusters, e.g., 
the 16-site cluster, even a very small Na{— 2) will make 
ED inapplicable. 



QMC ALGORITHMS AND THE SIGN PROBLEM 

The average sign in the DQMC method is equivalent 
to the average sign in the HF-QMC method in the limit 
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of infinite number of bath bands, Na oo. To prove 
this, we use the path-integral formahsm and write the 
partition function as 



'5'(7,7*) 



(7) 



where 2?[..] denotes the functional integral, S is the 
action, and 7 and 7* are Grassmann variable vectors. 
Equation ([7]) can be approximated by 



E 

s.,l=±l 



P[7]P[7*]e-^°('^'^*)e-5^(T-'^*) (g) 



where 'S'(o)/ is the (non)interacting part of the action and 
7c and 7* represent the c— band components. In Eq. 

we have used HHS transformation to decouple the 
correlation in the interacting part of the action. 



i,l^c I,/ — l,cr 



(9) 



Here, cosh(A) — e^"^*^/^, Sij is the auxiliary field and I 
is the time index so that t; — IAt — l(3/L. The non- 
interacting part of the action has the following form: 



5'o(7,7*) = E 



'y7n ,1.(7 '^m .1—1,(7 

a7 



(10) 



where Hq is the non-interacting part of the Hamiltonian 
and m denotes both the spacial coordinate and the band 
index (including the c band). Equation ([S]) becomes ex- 
act in the limit of At ^ 0. By integrating out all the 
Grassmann variables in Eq. ([8]), one obtains the following 
expression: 



Z oc Tr^^ . J J det[G^ 



det[G^i] 



(11) 



where Ga is the Green's function of size NL with N — 
iVc + 7VciV„. 

In the DQMC algorithm. Ho- det[Gg:^] is used as the 
sampling weight to complete the sum over the auxiliary 
field. Note that the action is off-diagonal in time, except 
for the first term of the non-interacting action which is 
equal to one along the diagonal [see Eq. ([TU])]. There- 
fore, is an off-diagonal sparse matrix with identity 
matrices along the diagonal and its determinant can be 
evaluated from a smaller matrix of size N, using the fol- 
lowing identity: 

det[G;i] = det[/ + B.^lB^^l-i ■ ■ ■ S,,2S,,i] (12) 

where B^ i is the corresponding off-diagonal sub-matrix 
of G~^ at time slice I. The DQMC Markov process 
proceeds by proposing changes in the HHS fields which 
are local in space and time, Si^i — >■ —Si^i. Because of 



that, the ratio of the fermion determinants can be calcu- 
lated directly from just the diagonal entry of the Green's 
function. Similarly, the update of the Green's func- 
tion following an accepted move does not require a full 
0{N^) matrix inversion, but can be done in 0{N'^) op- 
erations. More details about this algorithm can be found 
in Ref. 

Now suppose that instead of integrating out all the 
Grassmann variables in Eq. ([8|) , we integrate out only the 
ones associated with the non-interacting electron bands. 
The partition function can then be written as 



=±1 



Z cx ^ / 2?[7c]2?[7c1e- 



(13) 



where 

S'c(7c,7c*)= E ^cid,aG~^{hl'J^l'hc3,l',a + Sl{'yc,lc)- 

(14) 

In the above equation, G is the non-interacting Green's 
function on the cluster {Q~^ = + S) whose Fourier 
transform to momentum and frequency space can be 
written as 



(15) 



Z (X Tr{,, J det[G;:/] det[G;/] 



In the limit of an infinite number of non-interacting 
host bands, — > 00, the self-consistent DCA hybridiza- 
tion function may be exactly represented by the analytic 
form of Eq. ©, r'(K,iw„) = r(K,iw„). Therefore, g 
will be equal to the DCA cluster-excluded Green's func- 
tion, (G~^ -I- 1])~^. By integrating out the rest of Grass- 
mann variables in Eq. (1131) , the partition function reads 

(16) 

where Gc is the DCA cluster Green's function of size 
A^c^. 

In HF-QMC, to complete the sum over the auxiliary 
field. Ho- det G~^ is used as the sampling weight. Un- 
like DQMC, where the inverse Green's function is sparse, 
here G^^ is a dense matrix with a dimension that grows 
with the number of time slices. The HF-QMC Markov 
process proceeds by proposing local changes in the HHS 
fields, Si^i — )• —Si^i. The cost to propose a change, i.e., 
to calculate the ratio of determinants [Eq. (IT51) ]. is low 
and does not depend upon L or Nc- If a change is ac- 
cepted, then the cluster Green's function matrix Gc must 
be updated. It is possible to write this step as a rank- 
one matrix update. However, since the inverse Green's 
function matrix is dense, it is not possible to decompose 
it into A'^c X A'c blocks similar to what was done above 
with DQMC. 

By comparing Eqs. (|lip and (|16p . one can write the 
following equation for a particular field configuration: 

Gdet[G7^^] det[G;"/] = det[Gf ^] det[G^^]. (17) 

Since C is independent of fields, the ratio of sampling 
weights will be the same and therefore, the measured 
quantities, including the average sign, will have the same 
statistics in DQMC and HF-QMC algorithms. 
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FIG. 1: (Color online) (a) The imaginary part of the DMFT 
hybridization function and fits to its analytic form of the ef- 
fective cluster problem [Eq. Q] for A^^ = 1, 2, and 3 versus 
Matsubara frequency, (b) The corresponding scaled devia- 
tions of the fits [Eq. ((B))] versus temperature. 
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RESULTS 

We apply this method to the 2D Hubbard model [Eq. 
([3])] on a square lattice with nearest-neighbor hopping, i, 
and show results for Art = 1/4 and the interaction equal 
to three quarters of the bandwidth (U = 6t) at filling, 
(n) = 0.86, throughout this paper; calculations at differ- 
ent doping regions and for interaction strength equal to 
the bandwidth lead to the same trends for the quantities 
discussed in this work [SG^ . The quality of the fit of the 
effective cluster hybridization function [Eq. to the 
DC A or DMFT hybridization function, F, is improved 
by increasing the number of non-interacting bath bands. 
In Fig. [IJa), we show the imaginary part of F(iaj„) and 
the corresponding data for F'(ia;„) from the fitting algo- 
rithm using different values of for a single impurity 
problem (DMFT). The improved quality of the fit at a 
low temperature (T — 0.12t) can be seen as Na increases 
from 1 to 3. We find that for a finite A^q, the quality 
of the fit always decreases as the temperature is lowered. 
This can be seen in Fig. [Ijb) where we show the scaled 
deviation of the fit [Eq. ^\ for different values of as 
a function of temperature. The hybridization function is 
poorly fit for = 1 even at high temperatures. How- 
ever, the scaled deviation is strongly reduced when Na 
increases. 

As the number of bath degrees of freedom increases, 
DQMC recovers the HF-QMC results for a single-site 
problem. We find that a maximum of four bath 
bands are sufficient for the agreement of the two meth- 
ods at temperatures as low as T = 0.07t. This 
convergence is shown in Fig. [5] for Na < 3 where 
we plot the Matsubara frequency quasi-particle weight 
(Zo(K) = [l-ImE(K,i7rr)/7rT]~i), local moment {^J? = 
{{ria — n^cr)'^'^) and the Green's function, calculated us- 
ing HF-QMC and DQMC solvers. To have an idea about 
the absolute errors, we have also included results from 
an exact solution, i.e., HF-QMC with a very small At 
(= l/Wt). We point out that the average fermion sign, 
shown in Fig.[5][c), is equal to one, regardless of the bath 
in the single-site limit. 



FIG. 2: (Color online) The convergence of DQMC to 
HF-QMC by increasing A'^^ for a single impurity problem 
(DMFT). We plot (a) the Matsubara frequency quasi-particle 
fraction versus temperature, (b) the unscreened moment ver- 
sus temperature, (c) the average sign versus inverse tempera- 
ture and (d) the Green's function at a low temperature versus 
imaginary time, calculated using HF-QMC and DQMC as im- 
purity solvers. For comparison to exact results, a HF-QMC 
solution with very small At is also presented. For DQMC, we 
show results for A^^ = 1, 2, and 3. For a single-site problem, 
the average sign is exactly one in all cases. The statistical 
error bars are smaller than the symbols and are not shown. 



The DQMC is a well-behaved cluster solver for the 
DCA as the number of bath bands needed to recover the 
HF-QMC results decreases with increasing cluster size. 
This can be understood from the suppression of the cou- 
pling between cluster and host degrees of freedom. In 
fact, it was shown previously that the hybridization func- 
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FIG. 3: (Color online) Same as Fig. [2] for a 2 x 2 cluster in 
the DCA. In (a) and (d), we plot the quasi-particle fraction 
at K = (tt, 0) and the Green's function at the origin, respec- 
tively. Results for A'^^ = 1 cannot be obtained due to a bad 
sign, even at relatively high temperatures. 
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FIG. 4: (Color online) Same as Fig. |3] for a 4 x 4 cluster. 
For this duster, the convergence of DQMC to HF-QMC is 
achieved with No. = 2. In (c), we also show the average sign 
for a finite-size (FS) DQMC calculation on this cluster using 
the same model parameters. 

tion in the DC A is of order OiXjNc ), where D is the 
dimensionality j2j]. To illustrate that, we plot in Fig. [3] 
the same quantities of Fig. [2] using the same model pa- 
rameters but now calculated on a 2 x 2 cluster. For this 
cluster, the DQMC results show very good agreement 
with those of HF-QMC up to /3t = 34 when = 3. 
As proven in the previous section, the average sign in 
DQMC converges to its HF-QMC value by increasing Na 
[see Fig.[3jc)]. We find that the sign shows a strong sen- 
sitivity to the quality of the hybridization function fit. 
Thus, when iVc > 1, results for Na = 1 can not be ob- 
tained due to a bad sign problem, even at relatively high 
temperatures. In Figs.[3l^a) and[3l^d), we show the quasi- 
particle fraction at K = (tt, 0) and the Green's function 
at the origin in real space, respectively. 

The DQMC cluster solver is best suited for larger clus- 
ter simulations where = 2 is sufficient to recover the 
HF-QMC results. As an example, we present results for 
a 4 X 4 cluster in Fig. |4l We find excellent agreement be- 
tween HF-QMC and DQMC calculations when Na = 2. 
Here, the average sign falls more rapidly by decreasing 
temperature than that of the 2x2 cluster [see Fig. IHc)] . 
This limits the calculations for this cluster to pt < 15 in 
the optimally doped region. However, as can be seen in 
Fig. HJc), the average sign is significantly improved from 
a finite-size DQMC calculation. 

As in HF-QMC, analytic continuation can be per- 
formed to calculate real-frequency quantities when 
DQMC is used as the cluster solver. As an example, 
we have considered the case of Fig. 2] and calculated the 
single-particle density of states (DOS) using the maxi- 
mum entropy method [25| . The results indicate that dis- 
cretizing the bath degrees of freedom does not have a sig- 
nificant influence on the spectra. A comparison between 
HF-QMC and DQMC DOS has been presented in Fig. [S] 
where we find that there is a very good agreement be- 



FIG. 5: (Color online) Density of states for the case study of 
Fig- SI The solid (dashed) hne shows the results for HF-QMC 
(DQMC with iVc = 2). 



tween the two density of states in the low energy region. 
However, there is a slight difference in the high-energy 
region which would presumably vanish by increasing Na ■ 



SCALING 

As discussed in previous sections, the linear scaling of 
the DQMC algorithm with the number of time slices is 
the main advantage of this cluster solver over HF-QMC. 
The updating process in HF-QMC, which is the most ex- 
pensive step in this algorithm, scales like (NcL)^. This 
is a results of 0{NcL) changes in the field variable dur- 
ing each sweep and 0{N^L^) operations to update the 
Green's function for each change, using a rank-one updat- 
ing mechanism. A similar argument applies to the scaling 
in the DQMC, except that it costs 0{{Nc + NcNa)^) to 
update the inverse Green's function after each change in 
the field variable. Since the number of HHS fields and 
therefore, the number of such updates is proportional to 
L, the overall scaling of updates in DQMC is linear in L. 
The scaling in the system size remains cubic as in other 
QMC methods and is a big advantage over ED which 
scales exponentially in the size. To show the linear be- 
havior in L, we plot the CPU time for updates versus 
L on the 4x4 cluster in Fig. [5] (a). First, we compare 
this to that of HF-QMC for the same model parameters 
and by setting f3t = 2.5. At this fixed /3, the product 
of matrices in DQMC is stable, which results in a per- 
fectly linear scaling. We find that the updating step in 
DQMC is up to three orders of magnitude faster than in 
HF-QMC for a large number of time slices {L ~ 200). 

In more realistic simulations, increasing L is a conse- 
quence of increasing /3 to access low temperatures for a 
fixed order of systematic error (constant At) [H, l27j . 
In this case, we do not expect to see any change in the 
scaling of HF-QMC. However, in DQMC, an orthogo- 
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nalization step which scales as L^, has to be performed 
to avoid the round-ofF errors. To show how the DQMC 
scaling changes, we also plot in Fig. |6l[a), the CPU time 
for DQMC with Art ==1/16. We see that the orthogo- 
nalization step introduces a quadratic term in L with a 
coefficient which is two orders of magnitude smaller than 
the coefficient of the linear term [see diamond symbols 
in Fig. HJa)]. This effect on the performance of the algo- 
rithm becomes slowly significant only when L > 100. We 
point out that measuring the Green's function in DQMC 
involves matrix multiplications of the same type as in the 
updating process, and therefore results in the scaling of 
the CPU time that is very similar to the one for the up- 
dates. However, as can be inferred from Fig. |6l^b), mea- 
surements generally take more time than updates and 
the quadratic term appears even in the case of constant 
/3. The time for measuring the Green's function in HF- 
QMC has more or less the same scaling as in DQMC, but 
is roughly an order of magnitude larger when L 200. 



DISCUSSION 

In this paper we have shown that the use of DQMC 
as a cluster solver provides several order of magnitude 
speedup over the HF-QMC algorithm, with a sign prob- 
lem which is well behaved (identical to HF-QMC). This 
improvement arises from a fundamental reduction in the 
scaling of the algorithm, from cubic in the inverse tem- 
perature, /3, to linear in /? (with a small quadratic term 
arising from matrix orthogonalization to reduce round-off 
errors). 

However, the HF-QMC approach itself has already 
been supplanted in many applications by "continuous 
time" QMC (CTQMC) algorithms ^M^. We conclude 
this paper by addressing the relative strengths of the 
CTQMC technique and the new method presented here. 
CTQMC eliminates the systematic error inherent in HF- 
QMC and DQMC, including the method presented here, 
by stochastically sampling the reducible Feynman graphs 
of the partition function. Although the matrix sizes are 
generally smaller than in HF-QMC, the CTQMC algo- 
rithm also scales like the cube of the inverse tempera- 
ture /? [2§|. So, DQMC is generally much faster than 
CTQMC when applied to finite sized systems [s^ and 
also for the embedded cluster problems presented here, 
especially at low temperatures. However, DQMC has 
the disadvantage of the introduction of systematic error. 
These systematic errors in HF-QMC and DQMC may 
be eliminated by extrapolating the measured quantities 
in the time step squared, At^ — > 34]. Since the val- 
ues of At that are used in this extrapolation are not 
overly small, the linear in /3 nature of the present algo- 
rithm makes for far more efficient calculations, especially 
at lower temperatures. 
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FIG. 6: (Color online) The CPU time required for (a) updat- 
ing and (b) measurement parts of the HF-QMC and DQMC 
{Na = 2) algorithms versus the number of time slices on a 
4x4 cluster. All other quantities are kept constant. The 
lines show power-law fits of the data. The diamond symbols 
show the CPU time in DQMC with a constant At (decreasing 
temperature) where orthogonalization is performed to stabi- 
lize the matrix multiplications. 



CONCLUSIONS 

We have developed a DQMC cluster solver for the 
DMFT, DCA, or CDMFT which scales linearly in the in- 
verse temperature but has the same minus sign problem 
as HF-QMC. Formally, this is accomplished by defining 
an effective Hamiltonian for the embedded-cluster prob- 
lem which includes non- interacting bands for the host. 
The additional Hamiltonian parameters associated with 
the bath bands are adjusted to fit the cluster-host hy- 
bridization function. We prove that when this fit be- 
comes accurate, this DQMC algorithm recovers the same 
average sign as HF-QMC. Using DCA simulations of the 
two-dimensional single-band Hubbard model, we demon- 
strate that as the number of bath bands increases, we 
recover the HF-QMC results, including the average sign. 
The required number of bands is small, increases slightly 
with lowering temperature, and decreases with increasing 
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cluster size. 
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